From Documents to Dialogue: A step-by-step RAG Journey
dev.toΒ·8hΒ·
Discuss: DEV
πŸ“ŠMulti-vector RAG
Welcome to LIL’s Data.gov Archive Search
lil.law.harvard.eduΒ·2h
πŸ’ΎData Preservation
DupeGuru lets you quickly find and remove duplicate files from your drives
techspot.comΒ·1d
πŸ”„Content Deduplication
Efficient and accurate search in petabase-scale sequence repositories
nature.comΒ·2dΒ·
Discuss: Hacker News
πŸ”„Burrows-Wheeler
A Fuzzy Logic-Based Framework for Explainable Machine Learning in Big Data Analytics
arxiv.orgΒ·2d
🧠Machine Learning
Unreliable library of human knowledge
flowingdata.comΒ·1d
πŸ“°Content Curation
Paper2Agent: Research Papers as Interactive AI Agents
huggingface.coΒ·6hΒ·
Discuss: Hacker News
πŸ€–AI Curation
The Dunhuang Culture ζ•¦η…Œζ–‡εŒ– Database
digitalorientalist.comΒ·9h
πŸ“œText Collation
Automated Copyright Infringement Detection via Semantic Fingerprinting and Dynamic Thresholding
dev.toΒ·1dΒ·
Discuss: DEV
πŸ‘οΈPerceptual Hashing
Doing Math with Embeddings for Better AI Ad Targeting
ethicalads.ioΒ·1dΒ·
Discuss: Hacker News
πŸ“ŠFeed Optimization
An enough week
blog.mitrichev.chΒ·1dΒ·
πŸ“ˆLinear programming
Retentive Relevance: Capturing Long-Term User Value in Recommendation Systems
arxiv.orgΒ·18h
🎯Content Recommendation
Homomorphism Problems in Graph Databases and Automatic Structures
arxiv.orgΒ·18h
πŸ”—Graph Isomorphism
YouTube gets ~5% CTR lift on Shorts by replacing embedding tables with Semantic IDs
shaped.aiΒ·22h
πŸ“ŠFeed Optimization
Show HN: Lore Engine – Turn 10-hour lectures into 2 hours of comprehensive notes
github.comΒ·1dΒ·
Discuss: Hacker News
πŸ“„Document Streaming
Offensive OSINT s05e10 - Interactive investigative stories part 1
offensiveosint.ioΒ·2d
🌐WARC Forensics
Show HN: Rebuilt Bible search app to run 100% client-side with Transformers.js
biblos.appΒ·46mΒ·
Discuss: Hacker News
πŸ“œBinary Philology
The Complete Guide to Building High-Quality Backlinks in 2025
dev.toΒ·11hΒ·
Discuss: DEV
🧭Content Discovery
Nearest Neighbor CCP-Based Molecular Sequence Analysis
arxiv.orgΒ·18h
πŸ”„Burrows-Wheeler
Show HN: Comparegpt.io – Trustworthy Mode to reduce LLM hallucinations
news.ycombinator.comΒ·21hΒ·
Discuss: Hacker News
πŸ”BitFunnel